Skip to content

Conversation

netrunnereve
Copy link
Collaborator

The Ubuntu Vulkan SDK package is being discontinued, so I switched that to the generic one. I also made a couple changes to make it run faster (new llvmpipe and FP32 tests as that's easier for the CPU to run).

@netrunnereve netrunnereve requested a review from CISC as a code owner September 27, 2025 21:52
@github-actions github-actions bot added the devops improvements to build systems and github actions label Sep 27, 2025
@CISC CISC requested a review from 0cc4m September 28, 2025 08:38
Copy link
Collaborator

@CISC CISC left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please cache the SDK similarly to #15887

@netrunnereve
Copy link
Collaborator Author

Ok this should do it. Turns out it only saves like 30 seconds of runtime.

@CISC
Copy link
Collaborator

CISC commented Oct 1, 2025

Ok this should do it. Turns out it only saves like 30 seconds of runtime.

It will mainly prevent spamming external sources, and avoiding outages etc.

@netrunnereve netrunnereve marked this pull request as draft October 2, 2025 00:55
@github-actions github-actions bot added documentation Improvements or additions to documentation build Compilation issues script Script related testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs Vulkan Issues specific to the Vulkan backend examples python python script changes server ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) OpenCL Issues specific to the OpenCL backend labels Oct 2, 2025
@netrunnereve netrunnereve removed documentation Improvements or additions to documentation build Compilation issues script Script related testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs Vulkan Issues specific to the Vulkan backend examples python python script changes server ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) OpenCL Issues specific to the OpenCL backend labels Oct 2, 2025
@netrunnereve netrunnereve marked this pull request as ready for review October 2, 2025 01:03
@ggerganov
Copy link
Member

Good to merge?

@0cc4m 0cc4m merged commit f09aefa into ggml-org:master Oct 2, 2025
54 of 57 checks passed
@netrunnereve netrunnereve deleted the vulkan_ci branch October 2, 2025 14:49
gabe-l-hart added a commit to gabe-l-hart/llama.cpp that referenced this pull request Oct 3, 2025
* origin/master: (124 commits)
metal : fix loop bound in ggml_mem_ranges (ggml-org#16412)
llama : fix shapes for bert/mpt q/k norm (ggml-org#16409)
ggml : fix graph reallocation with multiple chunks (ggml-org#16396)
Fix missing messages on sibling navigation (ggml-org#16408)
vulkan: Replace uses of maxMemoryAllocationSize and VK_WHOLE_SIZE (ggml-org#16354)
vulkan: Fix FA coopmat1 invalid array indexing (ggml-org#16365)
ci : change macos-13 to macos-15-intel (ggml-org#16401)
Capture model name only after first token (streaming) or completed request (ggml-org#16405)
vulkan: in flash attention, bounds check against nem1 (don't rely on GGML_KQ_MASK_PAD) (ggml-org#16316)
webui : Fix messages payload sent to chat completions (ggml-org#16402)
fix: track viewportHeight via window.innerHeight to avoid unwanted scrolling (ggml-org#16356)
test-barrier : do not use more threads than physically available (ggml-org#16389)
ggml webgpu: add support for soft_max, optimize rms_norm (ggml-org#16357)
model : Apertus model implementation (ggml-org#15852)
musa: update compile flags (ggml-org#16265)
ci : fix ubuntu-latest-cmake-rpc (disable ccache) (ggml-org#16388)
ci: update vulkan ci (ggml-org#16294)
ci : fix clean-up of old logs (ggml-org#16381)
SYCL: Update to oneAPI 2025.2 (ggml-org#16371)
HIP: add IMbackK to codeowner (ggml-org#16375)
...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
devops improvements to build systems and github actions
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants